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performing a cluster analysis along the selected feature to grpup the data into one or more 
clusters; 

constructing one or more arcs of the decision tree at the j^fode respectively for each of the one 
or more clusters; 

projecting the data in each of the clusters, wherein tkfe projected data are characterized by the 

plurality of the features but for the selected feature; and 
recursively performing the steps of selecting a ^ature and performing the cluster analysis on 

the projected data in each of the clusters. 

2. (Original) The method according tp claim 1, wherein the step of selecting the feature 
includes the steps of: 

performing a plurality of cluster arialyses along each of the features to calculate a maximal 
cluster validity measure, saijd maximal cluster validity measure corresponding to one of 
the features; and 

selecting the one of the features that corresponds to the maximal cluster validity measure. 



3. (Original) The metnod according to claim 2, wherein the step of performing a plurality of 
cluster analyses along each of the features to calculate a maximal cluster validity measure 
includes the performiiVg the steps of: 

for each of the /features, performing a plurality of cluster analyses along said each of the 
features f^r a plurality of cluster numbers to calculate respective partition coefficients; 
and 

determinmg the maximal cluster validity measure from among the partition coefficients. 
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4. (Original) The method according to claim 1, wherein the stejybf performing the cluster 
analysis includes the step of performing a fuzzy cluster analysis. 

5. (Original) The method according to claim 4, wherein/the step of performing the fuzzy 
cluster analysis includes the step of performing a fuzzy c-m/ans analysis. 

6. (Original) The method according to claim 1, wherein the step of performing the cluster 
analysis includes the step of performing a hard cluaxer analysis. 

7. (Original) The method according to claim 1, wherein the step of performing the cluster 
analysis along the selected feature to groyp the data into one or more clusters includes the steps 
of: 

calculating a domain ratio of a ^fifference in domains limits of the data over a difference in 

domain limits of a supersei of the data; 
determining whether the d/main ratio has a predetermined relationship with a predetermined 

threshold; and 

if the domain ratio ha^ the predetermined relationship with the predetermined threshold, then 
grouping the dam into a single cluster. 

8. (Original) Tpo method according to claim 7, wherein the step of determining whether the 
domain ratio haar the predetermined relationship with the predetermined threshold includes the 
step of determining whether the domain ratio is less than the predetermined threshold. 



9. (Cahceled) 
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10. (Currently Amended) A method for generat/ng a decision tree for a plurality of data 
characterized by a plurality of features, comprising: / 

performing a plurality of cluster analyses along each of the features to calculate a maximal 
cluster validity measure, said maximal cmster validity measure corresponding to one of 
the features; / 

selecting the one of the features corresponding to the maximal cluster validity measure; 

subdividing the data into one or more grdups based on the selected feature; and 

building the decision tree based on the one or more groups, 

wherein the step of performing the cluster analyses along each of the features to calculate a 
maximal cluster validity measure includes the performing the steps of: 
for each of the features, perfomning a plurality of cluster analyses along said each of the 

features for a plurality of cluster numbers to calculate respective partition coefficients; 

and / 
determining the maximal cluster validity measure from among the partition coefficients. 

11. (Canceled) / 

12. (Original) The method according to claim 10, wherein the step of performing the cluster 
analyses includes the ste!p of performing a plurality of fuzzy cluster analyses. 



13. (Original) The method according to claim 10, wherein the step of performing the fuzzy 
cluster analyses iricludes the step of performing a plurality of fuzzy c-means analyses. 
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14. (Original) The method according to claim 10, wherein the step of performing the cluster 
analyses includes the step of performing a plurality of hard cluster analyses. 

15. (Original) The method according to claim 10, wherein the step of performing the cluster 
analyses includes the steps of: 

calculating a domain ratio of a difference in domains limits of the data over a difference in 

domain limits of a superset of the data; 
determining whether the domain ratio has a predetermined relationship with a predetermined 

threshold; and 

if the domain ratio has the predetermined relationship with the predetermined threshold, then 
grouping the data into a single cluster. 

16. (Original) The method according to claim 10, wherein building the decision tree based 
on the one or more groups includes the steps of: 

projecting the data in each of the groups, wherein the projected data are characterized by the 

plurality of the features but for the selected feature; and 
recursively performing the steps of selecting a feature, comprising selecting a new one of the 

features corresponding to a new maximal partition coefficient and subdividing the data 

into one or more new groups based on the selected new feature. 
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17. (Original) A method for generating a decision tree for a plura^xy of data characterized by 
a plurality of features, comprising: 

performing a plurality of fuzzy cluster analyses along eadi of the features to calculate a 
maximal partition coefficient and a corresponding sei of one or more fuzzy clusters, said 
maximal partition coefficient corresponding to one/of the features; 
selecting the one of the features corresponding to the maximal partition coefficient; and 
building the decision tree based on the corresponding set of one or more fuzzy clusters. 

18. (Currently Amended) A computer-readable medium bearing instructions for refining a 
node of a decision tree associated with a pli^ality of data characterized by a plurality of features, 
said instructions being arranged to cau^6 one or more processors upon execution thereby to 
perform the steps of: 

selecting a feature from among tHe features characterizing the data associated with the node; 
performing a cluster analysi^ along the selected feature to group the data into one or more 
clusters; 

constructing one or mo/e arcs of the decision tree at the node respectively for each of the one 
or more clusters; > 

projecting the data-in each of the clusters, wherein the projected data are characterized by the 

plurality of me features but for the selected feature; and 
recursively performing the steps of selecting a feature and performing the cluster analysis on 

the pro^cted data in each of the clusters. 

/ 
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19. (Original) The computer-readable medium according to claim 18, wh^ein the step of 
selecting the feature includes the steps of: 

performing a plurality of cluster analyses along each of the features t6 calculate a maximal 
cluster validity measure, said maximal cluster validity measur^ corresponding to one of 
the features; and 

selecting the one of the features that corresponds to the maximal cluster validity measure. 

20. (Original) The computer-readable medium according to claim 19, wherein the step of 
performing a plurality of cluster analyses along each yof the features to calculate a maximal 
cluster validity measure includes the performing the steps of: 

for each of the features, performing a plurality of cluster analyses along said each of the 
features for a plurality of cluster numb^s to calculate respective partition coefficients; 
and 

determining the maximal cluster validi)^ measure from among the partition coefficients. 

21. (Original) The computer-re/dable medium according to claim 18, wherein the step of 
performing the cluster analysis includes the step of performing a fuzzy cluster analysis. 

22. (Original) The computer-readable medium according to claim 21, wherein the step of 
performing the fuzzy clusl!er analysis includes the step of performing a fuzzy c-means analysis. 



23. (Original) The computer-readable medium according to claim 18, wherein the step of 
performing the duster analysis includes the step of performing a hard cluster analysis. 



7 



09/553,956 



Patent 



24. (Original) The computer-readable medium according to c\mm 18, wherein the step of 
performing the cluster analysis along the selected feature to group the data into one or more 
clusters includes the steps of: 

calculating a domain ratio of a difference in domains lin)its of the data over a difference in 

domain limits of a superset of the data; 
determining whether the domain ratio has a predeterpfiined relationship with a predetermined 
threshold; and 

if the domain ratio has the predetermined relationship with the predetermined threshold, then 
grouping the data into a single cluster. 




25. (Original) The computer-readable niedium according to claim 24, wherein the step of 
determining whether the domain ratio has me predetermined relationship with the predetermined 
threshold includes the step of deten;nining whether the domain ratio is less than the 
predetermined threshold. 



26. (Canceled) 



27. (Currently Amended) A computer-readable medium bearing instructions for generating a 
decision tree for a plurality of data characterized by a plurality of features, said instructions being 
arranged to cause one or mfore processors upon execution thereby to perform the steps of: 

performing a pluramy of cluster analyses along each of the features to calculate a maximal 
cluster validi^^measure, said maximal cluster validity measure corresponding to one of 
the features;, 

selecting the one of the features corresponding to the maximal cluster validity measure; 
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subdividing the data into one or more groups based on the selected feapre; and 
building the decision tree based on the one or more groups, 

wherein the step of performing the cluster analyses along each xSf the features to calculate a 
maximal cluster validity measure includes the performing/he steps of: 
for each of the features, performing a plurality of cluster analyses along said each of the 
features for a plurality of cluster numbers to cal9ulate respective partition coefficients; 
and 

determining the maximal cluster validity meagre from among the partition coefficients. 

28. (Canceled) 

29. (Original) The computer-readabl^ medium according to claim 27, wherein the step of 
performing the cluster analyses includes the step of performing a plurality of fuzzy cluster 
analyses. 

30. (Original) The computer-readable medium according to claim 27, wherein the step of 
performing the fuzzy clustey analyses includes the step of performing a plurality of fuzzy c- 
means analyses. 



31. (Original) The computer-readable medium according to claim 27, wherein the step of 
perfonrdng the cloister analyses includes the step of performing a plurality of hard cluster 
analyses. 
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32. (Original) The computer-readable medium ajtcording to claim 27, wherein the step of 
performing the cluster analyses includes the steps of: 

calculating a domain ratio of a difference in dojmains limits of the data over a difference in 

domain limits of a superset of the data; 
determining whether the domain ratio has a predetermined relationship with a predetermined 

threshold; and 

if the domain ratio has the predetermined relationship with the predetermined threshold, then 
grouping the data into a single cluster. 

33. (Original) The computer-readable medium according to claim 27, wherein building the 
decision tree based on the one or more groups includes the steps of: 

projecting the data in each of the groups, wherein the projected data are characterized by the 

plurality of the features but for toe selected feature; and 
recursively performing the steps of selecting a feature, comprising selecting a new one of the 

features corresponding to a new maximal partition coefficient and subdividing the data 

into one or more new groups based on the selected new feature. 

34. (Original) A computer-readable medium bearing instructions for generating a decision 
tree for a plurality of data characterized by a plurahty of features, said instructions being arranged 
to cause one or more processors upon execution thereby to perform the steps of: 

performing a plurality of fuzzy cluster analyses along each of the features to calculate a 
maximal partition coefficient and a corresponding set of one or more fuzzy clusters, said 
maximal partition/coefficient corresponding to one of the features; 
selecting the one of jlhe features corresponding to the maximal partition coefficient; and 
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building the decision tre^ased on the corresponding set of one or more fuzzy clusters. 
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